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Abstract 

Multicriteria decision analysis aims at supporting a person facing a de¬ 
cision problem involving conflicting criteria. We consider an additive util¬ 
ity model which provides robust conclusions based on preferences elicited 
from the decision maker. The recommendations based on these robust 
conclusions are even more convincing if they are complemented by expla¬ 
nations. We propose a general scheme, based on sequence of preference 
swaps, in which explanations can be computed. We show first that the 
length of explanations can be unbounded in the general case. However, in 
the case of binary reference scales, this length is bounded and we provide 
an algorithm to compute the corresponding explanation. 


1 Introduction 

A multi-criteria problem consists in formalizing the problem and eliciting the 
preferences of the decision maker (DM). In many decision contexts providing 
recommendations based only on the elicited preference model is insufficient. In 
fact, decision makers may want explanations which justify in a convincing way 
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such recommendations. Indeed, justifying and explaining a rationale for a de¬ 
cision is almost as important as the recommendation itself. This is particularly 
true in situations where the decision needs to be justified to some other stake¬ 
holders (who did not participate to the decision process). In this case, it is not 
satisfactory to just present the preference model and the resulting recommen¬ 
dation. Although technically this model does contain all the information on 
which such a recommendation is based, the format is unlikely to be suitable for 
presentation. Hence, the need for a synthetic and short explanation. Building 
a convincing explanation is also required when the DM cannot be assumed to 
have any mathematical background, as in the case of online recommender sys¬ 
tems, where it has been shown that explanations improve the acceptability of 
the recommended choice |PC07j . 

The problem of constructing or providing convincing explanations in order to 
justify recommended decisions has a rather long history in Artificial Intelligence 
(see for instance |CM061 IKle941 ISNM091 IQPFP07] !. In a nutshell, the idea 
is to provide supporting evidence that a recommendation is justified. This 
evidence may emphasize some critical data used for the recommendation, and/or 
provide a simplified version of the process which lead to the recommendation 
[ iHKROOl FZlll . However, the problem is especially difficult in the context of 
multiple criteria models ILablll 1LMQ12) . where different criteria are at stake. 

In this paper we shall concentrate on the simple additive utility model. This 
well-known model assumes independence among criteria, although of course 
different criteria may have different weights. In other words, no synergy (ei¬ 
ther positive or negative) occurs between the different criteria. An example of 
an elicitation method that relies on such a model, is the even swaps method 
IHKR981 . It aims at identifying, between two options x and y 1 which one is 
preferred to another one without explicitly constructing the utility functions. 
This is basically an elimination process based on trade-offs between pairs of 
attributes (hence the name even swaps). Broadly speaking, in such a swap, the 
DM changes the consequence (or score) of an alternative on one attribute, and 
compensates this change with one another attribute, so that the new alterna¬ 
tive is equally preferred in the end. By replacing one option (say x) with a 
different but equally preferred one, the hope is that dominance will occur over 
y. The process is thus repeated until dominance can be shown to hold, allowing 
to progressively eliminate attributes. 

The UTA (Additive UTility) method is another way to elicit the preference 
of the decision maker and to construct a recommendation |.TS82j . The DM is 
supposed to provide a set of comparison of alternatives. The advantage of this 
approach over the even swaps method is that the required preferential informa¬ 
tion is simpler from the DM point of view. Trade-offs are indeed very complex 
to provide. In fact, as it was pointed in IHKR98I : “Making wise trade-offs is 
one of the most important and difficult challenges in decision making”. On 
the other hand, the main drawback of the UTA method is that the preferential 
information brought by the DM is not sufficient to uniquely specify the util¬ 
ity functions (utility functions are only partially known), and UTA does not 
account for the multiplicity of the compatible utility. More precisely, this fact 
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is dealt by choosing an arbitrary completion. As a matter of fact, a conserva¬ 
tive approach consists in relying on a robust (or necessary) preference relation 
(GMS081 IGSFMlOl . In words, the relation holds if any possible completion of 
the available preferential information yields the preferential statement. 

The aim of the paper is to explain a robust preference relation. When utilities 
are known, one can interpret them in terms of the importance of criteria and the 
satisfaction level of criteria |Kle941 [CM06, La bll] . This approach is not possible 
with the robust preference relation as the robust preference of an option x over 
another one y is complex. The idea is to decompose a robust preference into 
several simpler recommendations, as it is the case in the even swaps approach. 

However, the even swaps approach suffers from a limitation: by requiring 
each new generated option to be equally preferred to the initial one, this makes 
the technique poorly adapted to the context of incomplete preferences where 
such an equivalence virtually never hold. The generalization of even swaps to 
robust relations is thus called preference swaps. A very interesting property 
of even-swaps (for explanations) to justify a recommendation is that it does 
not require explicitly the value of utility function or the model used to get the 
solution. 

Therefore, we construct a sequence of alternatives in which the first option 
is x and the last one is y, and there is a robust preference between any two 
successive elements in the sequence. The existence of such a sequence entails 
the necessary preference of x over y, by transitivity of the robust preference 
relation. The options in the sequence are constructed in such a way that each 
comparison in the sequence is simple to understand for the DM. This is in 
particular the case if two successive alternatives in the sequence are similar on 
most of criteria and differ only on a few criteria. This sequence of alternatives 
provides an explanation of the robust preference of x over y. 

The remainder of the paper is as follows. Section [2] describes the background 
notions and concepts that we shall use in formulating the explanation. Section[3] 
presents new results concerning the necessary preference relation. These results 
are used to derive properties of the explanation engine introduced in this paper. 
This engine is described and discussed in Section [U in terms of some technical 
characteristics that we believe important to deal with the question of construct¬ 
ing a formal explanation. Finally, in Section [5] we address the case of binary 
preference scales, and show the properties of explanations when preferences are 
expressed in such scales. 

2 Background Notions and Literature review 

We consider a finite set N = {l,...,n} of criteria. Each criterion i € TV is 
described by an attribute Xi. We assume that all attributes are numerical 
and increasing. For discrete attributes, Xi represents integers. For continuous 
attributes, Xi is an interval (possibly unbounded). Alternatives are considered 
as elements of the Cartesian product of the attributes: X = Xi x ■■■ x X n . 

In this section we recall the principles of the standard additive utility model, 
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see section EU Then section EZH describes shortly the even swaps approach. 
Lastly section 12.31 explains the robust preference relation. Moreover, in order 
to get an understanding of the notions and different proposals of this paper we 
shall use the Example |f] The idea is to provide, at the end, an explanation for 
why an option is the best choice for the case considered. 

Example 1 You need to rent an office for your business and your are unde¬ 
cided between two options, namely: x and y. Such options are evaluated ac¬ 
cording to four criteria: { 1: the Commute time (min); 2: the availability of a 
Sport club (gym, no gym); 3: the Size(mr), and 4: the Cost(€) }. Of course you 
want to minimize the cost and commute time, while you seek to maximize the 
availability of a sport club and the size. 

The criteria are described respectively by the following increasing attributes: 
A'i c R _ ,A 2 = {Yes,No},X 3 c M. + ,andX 4 , c IT. Thus, the evaluation of the 
previous options is as follows: 

x = (-45, no gym, 450,-5000) and y = (-15, gym, 180,-12500) 


2.1 Utility based preference model 

According to the additive utility model the comparison of two multi-attribute 
alternatives x = (aq, X 2 , •••, x n ) and y - (yi,y 2 , m ",y n ) is given by: 

x ~y ** XI u i( x i) ^ X! u i(yi) 

izN ieN 

where the utility functions Ui : Xi -*■ R are supposed to be known (and already 
elicited). This model entails separability, expressed by the ceteris paribus prin¬ 
ciple: when comparing two options, the criteria where the options have equal 
attributes do not count and can be merely ignored. In this paper, the wildcard 
“*” will denote the common evaluation when comparing two alternatives. For 
instance (-15, no gym, 200, *) >; (-50, gym, 200, *) means that the preference 
holds whatever the common evaluation on the fourth attribute. 

A popular method to construct an additive utility model is the UTA (UTility 
Additive) approach [JS82I . In this method, a subset V) of Aj is chosen for each 
attribute. The elements of Vi are denoted by V. = {vip < < ■■ ■ < Vi jPi } c Aj, 
where pi = |Vj|- For instance, as it is described in the Example El for the first 
attribute, Commute time, V\ - {a := -50 < A := -15}. 

We set V = V\ x... x V n . The unknowns of the decision model are the values of 
utility functions at the points in Vix---xV n , i.e. the values (Mj(uj l i), ..., Uj(i>j lPi ))j € jv- 
Given two alternatives x, y e A', the query “is x preferred to y ?” is de¬ 
noted (x >? y), and will be given a truth value by determining the pair (x,yfs 
membership of various preference relations over A', which are partial preorders : 
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reflexive and transitive binary relations, but not necessarily antisymmetric, and 
generally not complete^- For the moment, we consider two preference relations: 

• V: the Pareto dominance relation (in Example[2] V = {(e 2 £ 63 )} ); 

• V: the preferential information explicitly given by the decision maker, 
leveraged as a learning set (in Example [2] V = {(ei >; e2);(e3 > e 4 ), (e 4 > 
es)}). In the UTA method, the preferential information V is composed 
of preference statements of the form x > y, meaning that x is at least as 
good as y regarding the concerns of the DM. 

Example 2 (Ex\7 J Ctd.) 

Our aim in this paper is to build explanations, considering the case of binary 
reference scales (see Section 0). To stay within this framework and to demon¬ 
strate the feasibility of our proposal, we adapt the Example Q] to this context. 
Therefore, the preference information is collected using alternatives on binary 
reference scales. Thus, according to the Table 0 the preferential information 
V = {(ei £ e 2 ); (e 3 > e 4 ), (e 4 > e 5 )}. 


Criterion: i 

Commute time 

Sport club 

Size 

Cost 

Top level: 

A:=-15 

B:= gym 

C:= 400 

D:= -5000 

Bottom level: Vi \ 

a:= -50 

b:=no gym 

c :=200 

d:= -12000 


Table 1: Criteria scales 



Commute time 

Sport club 

Size 

Cost 

ei 

-15 

no gym 

400 

-12000 

e-2 

-50 

gym 

200 

-5000 

e-3 

-50 

no gym 

200 

-5000 

e 4 

-50 

gym 

400 

-12000 

e-b 

-15 

no gym 

200 

-12000 


Table 2: Evaluation of the learning set 


2.2 The even swaps method 

The even swaps method |HKR98j relies on an additive utility function. In order 
to choose the best alternative, this method does not require to fully elicit the 
marginal utility functions, but only a limited number of trade-offs between pairs 
of attributes (swaps). In other words, the DM does not have to explicitly define 
the preferences over the attributes in general or to make any assumption about 

1 A related question is to count the number of queries to come up with a complete linear 
order [FPR02] 
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the form of the utility function. More precisely, the decision maker changes the 
consequence (or score) of an alternative on one attribute, and compensates this 
change with a preferentially equal change on another attribute. This creates a 
new fictitious alternative, that is indifferent to the previous one, with revised 
consequences. We use this alternative to try to eliminate the other ones. The 
aim of this process is to carry out even swaps that make either alternatives 
dominated or attributes irrelevant. 

Intuitively, this method can be seen as a scattered exploration of the iso¬ 
preference curve (the curve where lies, even virtually, the alternatives equally 
preferred) of the DM. This constructive method is quite intuitive as only two at¬ 
tributes are involved in even swaps, and utilities are never explicitly mentioned 
to the DM. Moreover, the existence of indifference statements required by the 
use of the even swaps method is guaranteed by the solvability condition used in 
conjoint measurement [KLST711 chapter 6]. 

We can also note that |MH071 lMH05j propose to enrich the original even 
swaps method in a way that accounts for incomplete knowledge about the value 
function. They consider a “practical dominance” notion when the value of an 
alternative is at least as high as the value of another one with every feasible com¬ 
bination of parameters, this perspective being very close to the one developed in 
[ IGMS081 (see next section). However, this notion is only used for pre-processing 
dominated alternatives, and not integrated in the swap process, let alone used 
for explanatory purposes. 

2.3 Robust relation with the additive utility Model 

As already mentioned, the main drawback of the UTA method is that the pref¬ 
erential information brought by the decision maker is not sufficient to uniquely 
specify the utility functions. A conservative approach consists in relying on a 
robust (or necessary) preference relation [GMS081 IGSFMlOj . In words, the re¬ 
lation holds if any possible completion of the available preferential information 
yields the preferential statement. 

Definition 1 (Necessary Preference Relation [ GMS08, IGSFMlOj I 

x £ y iff XI Ui(Xi) > XI u i{yi) f° r al1 U i S± 

ieN ieN 

• Ui is a monotonically increasing function A) -» R, 

• Yj u i( a i) * X u i( b i) V[a>b]eV. 

ieN ieN 

• u i (s7 wn )>Vi e IV, X Ui{x™ ax ) = 1 where x™' m (x™' ax resp.) represent the 

ieN 

minimum (maximum, respectively) envaluation in Xf. 

Note that, once the DM has provided his preferential information V, the 
reference scales Vi are simply taken as the values that the alternatives in the 
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Figure 1: Necessary Preference relations 


preferential information use: 

Vi = {{a,i,bi} , (a > b) e V}. (1) 

According to Definition [ 1 ] we introduce a new relation: A/”, the necessary 
preference relation that can be deduced from V under the assumption of an 
additive utility model (see the following section for more details). N contains 
VfD and other pairs ( x,y ) such that x is necessarily preferred to y, which can 
be expressed by the statement (x > y ). For example, the graph of the Figure [T| 
represents the necessary relation Af build according to the Example [ 2 ] 
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3 Characterization of the robust value based pref¬ 
erence relation 

Our aim through this paper is to provide a solid mechanism to construct expla¬ 
nations for necessary preference relations. Before a new explanation framework 
is introduced and discussed in Section [IJ this section presents some new algo¬ 
rithmic results concerning the necessary preference relation. Such results are 
important to establish some properties of the explanation engine detailed in 
Section [5] 

3.1 Rounding queries to the reference scales 

Definition Q] enables to express queries where alternatives are taken in the whole 
evaluation space X = Xi x X 2 x ... x X n . However, on each criterion i e TV, 
preferential information V is expressed on a subset of Xj, the reference scale Vi 
(see (HI). 

When Xi ,yi e Vi for all* e TV the pessimistic evaluation for all monotonically 
increasing functions can be written as a minimization problem, where the ob¬ 
jective function and the constraints are linear functions of the Y pi variables 

ieN 

Ui(v itk ),ieN,l<k< Pi lOMSOSllOSFMIO] : 

(x >■? y) e JV 

t 

min Y Ui(xi) - Uifyi) > 0 (2) 

ieN 

( Ui{v i}k ) - Ui(v i: k’) >0 VieiV, Vk,k' , 1 < k' < k < Pi 

S -M Y u i( a i) ~ u i(bi) ^0 V(a>6)e V 

l izN 

In order to account for values Xi,yt outside V), previous studies IGMS081 
IGSFMIO] augment the reference scales Vi := Vi u thus allowing the 

resolution space to evolve dynamically according to the input alternative. We 
propose a characterization of the necessary relation as linear programs with vari¬ 
ables and constraints defined statically by the preferential information V. Alter¬ 
natives evaluated outside the reference scales are preprocessed and fitted onto 
V using a conservative rule, directly accounting for the worst case under mono¬ 
tonicity constraints. This fitting proceeds on each criterion i e N separately, 
as permitted by the additive utility model. On criterion i, the values of V); are 
sorted in ascending order, inserting Xi and yi into the scale 1 < Vi t 2 < ■ ■ ■ < v lJ}i . 
Three exclusive cases arise, as it is illustrated in the Figure [2]: 

• If yi is strictly at the top of the scale, and/or Xi is strictly at the bottom of 
the scale (i.e. Xi < yi and [ yi > Vi tPi or Xi < uqi]), then necessary preference 
of x over y is impossible, and we say the query (x >? y) is unbounded by 
V (see case (a) of Figure 0. 








• If the pair yi < Xi is adjacent (i.e. yi < Xi and $k yi < Vi t k < x*), then 
criterion i is neutral in the eventual necessary preference of x over y. The 
specific value of attributes Xi , y.-, does not matter, and can be replaced by 
the wildcard * (see criterion k in case (b) of Figure [2j). 

• Otherwise, there are indexes k,k' such that < x,; and yi < v.;^/ are 
adjacent pairs. We will show that the eventual necessary preference of 
x over y can be examined by considering the reference values (t-y^/ty*,') 
instead of (xi,yi). In this substitution, the candidate to preference x is 
rounded down to the next lower value in Vj, while the challenger y is 
rounded up (see criterion i and j in case (b) of Figure [2]). 

These cases are formalized by the following Definitions [2] and [3] 



criterion l criterion i criterion j criterion fc 

(a) Unbounded case (b) Bounded case 

Figure 2: Rounding of a query (x >■? y), where • and ■ represent respectively 
the value of option x and of option y. 


Definition 2 (bounded and unbounded preference queries) 

Let x, y e X, we say the preference query (x >? y) is unbounded by V if 3 i e 
N, yi > Xi and (yi > Vi Pi or Xi < Vn). Otherwise, the query (x >? y) is bounded 
by V ■ 


Definition 3 (Rounding of bounded preference queries) 

Let ( x >■> y) a bounded query. It is mapped to the rounded query, [x >■? y] := 
(x >■? y), where alternatives (x, y) are taken in the reference scale V and defined 
jointly as follows : 

w. N , -\ = f (*, *) , if Xi > yi and $k , < v iik < x t 

1 A-LuVi) i ( max {t) g Vi,v < Xi},min{v e > yi}), else. 


Note that queries expressed on alternatives taken inside the reference scale 
V are left unmodified when rounded. 
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Theorem 1 (Reduction to the reference scale) 

Let x,y 6 X. x is necessarily preferred to y iff the query (x >■? y) is bounded by 
V and [x >? y] e A/”. 

Proof. If the query (x >? y) is unbounded by V, then the LP © is unbounded, 
as the difference Ui{yf) - Ui{xf) is left unbounded by the constraints, and can 
be made as large as needed to ensure Ez<eat u i{yi) > T,kn u i( x i)- Thus, the LP 
is unfeasible and (x >? y) $ M. 

Else, suppose x is necessarily preferred to y. The proxy alternatives x,y are 
so defined that x dominates x and y is dominated by y. Hence, transitivity of 
the relation M ensures that x is necessarily preferred to y. 

Reciprocally, suppose x is not necessarily preferred to y : there is a vector 
u = (v,i : Vi -*■ R)i e Ar of increasing functions compatible with the preferences V 
ranking x lower than y. If required, these functions ( m ) can be extended, by 
specifying the images of ( Xi ) and/or (jjf) when they are not in the reference 
scales {Vi), into another vector {uf) of increasing functions compatible with 
the preferences V such that u{x) = u(x) < u(y) = u(y), i.e. x is not necessarily 
preferred to y. If {x^yf) * (*, *), let u^xf) := Ui{xf) and u.^yf) := u^yf). These 
definitions enforce the monotonicity of u t . If {x^yf) = (*,*), then Ui{xi) = 
Ui{yi) can be assigned any value preserving the monotonicity of u. 

□ 


3.2 Covector associated with a query 

The characterization of necessary preferences can be further streamlined. The 
objective function and the constraints can be expressed as functions of the 
elementary preferences Wi t k : = Ui(vi : k+i) - Ui(vi t k) indexed by I = {(i,k),i 6 
N,l<k< pi} : 


Vi e N , Vf € V) , Ui(t) =u i (vi ! i)+ Y, 

1 < k < pi 
Vi,k+1 < t 

= Ui(Vi, l)+ Y 

1 < k < pi 
Vi,k +1 ^ t 

Hence, 


Vi e N , Vxi,yi eVi , Ui(xi)-Ui(yi) = Y Y w i,k 

1 < k < pi 1 < k < pi 

Vi,k+1 < xi v it k +1 < Vi 

Consider the contribution of the elementary preference w^k to the right 
hand side : w t ,k contributes in favor of alternative x with coefficient +1 iff 
Xi > Vi t k +1 > Vi : k > Vi, in favor of y with coefficient -1 iff yt > Vi t k+i > Vi t k > xi, and 


(Ui(Vi t k+ 1 ) - Ui{v it k)) 

W iy k 
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does not contribute (with coefficient 0) otherwise. Consequently, we introduce 
the covector form of a query (for illustration see Figure [3]): 

Definition 4 (Covector associated to a preference query) 

fox-fo {-1,0,l} 7 

(x>?y) m- ( x>?y )* 

( +1 if Xi > Vi t k+ i > Vi : k > yi (wi t k is an argument supporting x) 

-1 if yi > Vi,k+i > Vi,k > xi (wi, k is an argument supporting y ) 

0 else {wi t k is a neutral argument ) 

The set of covectors associated to a dominance statement is {0, l} 7 . A 
dominance covector can be broken down as a sum of elementary dominance 
covectors d* k for (i,fc) e / : (d* fe )i',fc' = 1 if i = i' and k = k r , 0 otherwise. 
Actually, (d*) = ((1 ,0,..., 0), (0,1,0,..., 0),..., (0,..., 0,1)) is the canonical 
covector base. 

The introduction of covectors yields to a concise formulation : 

Vx,y € V , Y, u i( X i) - u i(yi) = Z ( x y)i,k w i,k 

ieAT ( i,k)el 

Finally, using the product notation between covectors and vectors of M 7 to 
omit indexes : 


Vx,y € V , u(x) - u(y) = (x >? y)* vj 

Consequently, the following theorem links the transitivity of the relation M 
and the sum of covectors. 

Theorem 2 (Chasles relation for covectors) 

Let x,y,z €V 

(x >? y)* + ( y >? z)* = ( x >? z)* 


3.3 Characterizations of the necessary preference relation 

In this subsection, we provide three equivalent linear programming formulations 
permitting to answer a necessary preference query. The first one is expressed 
as a condition on the elementary preferences, while the last two check a linear 
dependence relation between linear forms representing preferences. 

Theorem 3 (Primal atomic characterization of necessary preference) 


x is necessarily preferred to y 
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0 


Wi, 3 


Wi 2 


VJiA 


Wi,2 


+ 1 


-1 W k , 2 


-1 W k , 1 


criterion i criterion j criterion fc 

Figure 3: Elementary preferences w and associated covector coefficients, where 
• and ■ represent respectively the values x and y on the criteria 


ft 

( x >? y ) is bounded by V and [x >? y]* w > 0 for all w 6 s.t. 

{ Wi t k>0 y(i,k)el (monotonicity) 

(a>b)*w> 0 y(a>b)e V. 

Theorem [3] references [x >? y]*, the covector associated to a bounded query 
after it has been rounded to the reference space V. Such a covector can be com¬ 
puted in two phases, going successively through definitions [3] and [4] However, 
this process can be shortened. For criterion i € N: 

• If Xi > yi, only the intervals [vi,k,Vi,k+ i ] fully included in the asset [a 'i,yf\ 
are taken into account positively. Other intervals are neutral. 

• If yi > Xi , all the intervals [vi,k,Vi,k+i] necessary to cover the liability 
are taken into account negatively. Other intervals are neutral. 
There is a possibility for insufficient coverage, when (x >? y) is unbounded 
by V. 

Theorem 4 (Covector associated to a rounded query) 

Let (x >■? y) a preference query bounded by V . The coefficients of the covector 
[,x >■? y ]* representing its associated rounded query are given by : 

( +1 if [vi,k,Vi, k +i] <= [yi,Xi] 

[x £? y]* itk = t - 1 if[vi,k,Vi,k+i]n]xi,yi [*0 

[ 0 else 
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Establishing the preference of x over y can thus be seen as covering the 
liabilities of y by the assets of x. We call this characterization atomic, as it 
proceeds without splitting the fundamental resources ]'U»,fc,Uj,fc+i[. When such 
a resource would be split, it retains its full cost Wi t k as a liability but loses ah 
its buying power as an asset. When the whole range is not enough 

to meet the needed cost, preference is impossible. 

Theorem 5 (Dual characterization of necessary preference, LP form) 


x is necessarily preferred to y 

t 

the query (x >■? y) is bounded by V and there are real nonnegative coefficients 
(\)v and (p)j such that : 

[x>iy]*= Y A a,b(a>b)*+ Y Vi,kdlk 

(a>b)€ V ( i,k)el 

where the ( d*)i are the elementary dominance covectors. 

Proof. Thanks to Theorem [3J the necessary preference of x over y is charac¬ 
terized by the inconsistency of the linear system of inequations in the variable 
w € R 1 : 

f ~[x &? y]* w > 0 

-j d* k w> 0 V(i, k) e I 

( (a>b)* w> 0 V(a>6)e V. 

According to Farkas’s lemma [Man69l . this inconsistency is equivalent to the 
linear dependance of the covectors written in the theorem. 

□ 

This LP formulation is used in Algorithm [1] : FindExplanation described 
in Section [5j 

Following Definition [T| the necessary preference relation could appear as 
a black box, as it interprets a minimization over a functional space. From 
theorem O this relation corresponds to the conical combination of the assertions 
expressed in V and of the elementary assertions derived from monotonicity 
(see also |ST14] h We believe that actually strengthens the importance of the 
necessary preference relation, as one of the most fundamental tools to explore 
preference. 

The linear combination can be constrained to use rational numbers only, 
yielding to an ILP formulation : 

Theorem 6 (dual characterization of necessary preference, ILP form) 


x is necessarily preferred to y 
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ft 

the query (x >? y ) is bounded by V and there is a positive integer r and non 
negative integers (tj-p and (to)/ such that : 

r[x >7 y]* = Y £a,b(a>by+ Y m i.kd*^ k 

( a>b)eV (i,k)el 


where the ( d*)i are the elementary dominance covectors. 

Proof. The coefficients (A) and (//) of theorem [5] are solutions of an affine sys¬ 
tem of equations with coefficents in {-1,0,1 }. The solution set of this system is 
non-empty in M iff it is non empty in Q. Thus, let A = £ and /i = —, fractions of 
integers with common denominator r. Multiplying by r the relation with A, /i 
yields the relation sought with r, £, m. □ 

Note that the characterization proposed in the previous results is different 
from the characterization results shown for the additive utility model in conjoint 
measurement |KLST7fl chapter 9]. In Theorem [6l we provide necessary and 
sufficient condition for a given alternative x to be necessarily preferred to an 
alternative y, while axiomatic results in decision theory are interested in finding 
necessary and sufficient conditions on the preference relation to be representable 
by a given model, where these conditions take the form of properties that the 
relation shall satisfy. For instance the additive utility model is characterized by 
Archimedean property, double cancellation and independence, if the attributes 
are continuous IKI.ST71 , chapter 6]. However, this characterization does not 
apply to the necessary preference relation. 

Obviously, Theorem [5] is more convenient than Theorem [S] for implementa¬ 
tion purpose. Nevertheless, we believe the characterization given by theorem [G] 
gives an even deeper insight into the necessary preference relation. Indeed, the 
query appears to be a linear combination with integer coefficients of the pref¬ 
erential information and of dominance. This ILP form is leveraged to prove, in 
Section 0 a result on the length of the explanation in a particular case - see 
Theorem [8] (the proof is presented in EJ. 

4 Explanation Engine 

In this section, we present an original approach to explain a robust preference. 
We recall that our decision model is based on a robust additive utility model 
(see Section [2j). A straightforward way of providing explanation would use the 
computed utility functions and exhibit them to the decision maker. The diffi¬ 
culty of this approach is twofold. Firstly, utilities are basically non normalized 
as they return values in different scales, thus it is not always easy to interpret 
them. One easy way to solve this issue is to impose that the extreme values 
on each attribute are commensurate , this is the fact that valuations on dif¬ 
ferent criteria can be compared. More precisely, criteria are supposed to be 
not satisfied at all for every minimum element of the attributes, and criteria 
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are supposed to be completely satisfied for every maximum element of the at¬ 
tributes. This amounts to transform the additive utility model into a weighted 
sum, where the utilities are normalized and a weight is assigned to each criterion. 
One can then interpret the overall score in terms of the importance of criteria 
and the degree of satisfaction of criteria [Kle94l ICM061 ILabllj . Once more, 
this conversion requires to add a commensurability assumption, which one may 
not accept. In this latter situation, the utility functions in the additive utility 
model are meaningless to the decision maker. Secondly, the previous process 
is not possible when the utility functions are not precisely known, which is the 
case with the robust preference relation. 

We propose to explore a different venue in order to provide explanations 
easily understandable by the DM : sequences of simple preference statements, 
that can be followed step by step to check the decision process. This section is 
dedicated to detail this notion of explanation. 

4.1 Explanations based on preference swaps 

Our idea of explanation is somewhat reminiscent of the even swaps method (see 
Section 12.21) which is an interactive process aimed at building a sequence of 
equally preferred (even) alternatives, where only two attributes are modified at 
each step (swaps), initiated by the preferred term of the query and the final term 
of the sequence dominates the less preferred term of the query. This constructive 
method is quite intuitive as only two attributes are involved in each even swap, 
and utilities are never explicitly mentioned to the decision maker. In fact, if 
we consider the example displayed in IHKR981 , we can easily observe that from 
the reasoning steps of the even swaps process we can deduce an intuitive and 
simple manner to explain the result to the decision maker, without referring 
explicitly to the utility function or the model used to get the solution. We keep 
the idea of explanations made of an easy to follow sequence of alternatives, but 
we have to take into account several obstacles to import the even swap process 
wholesale. While the even swap method involves an interactive process aimed 
at choosing the best alternative through a sequence of indifference statements 
(and deletion of non discriminating criteria), our decision model is more cautious 
(robust and non-parametric), and we aim at providing an explanation without 
further interaction with the decision maker. In this context of deliberately 
frugal information, exploring an isopreference curve of the decision maker by 
asking equivalence queries is clearly not on the table. Therefore, we relax the 
notion of equal preference, and only require the sequence of alternatives to be 
of decreasing preference, thus introducing the notion of preference swap. 

Definition 5 (Preference swap) 

When (x > y) e J\f \ V, we say that x > y is a preference-swap of order p 
with p := |{/ € N,Xi * yi} | e {2 ,... ,n}. 


As preference swaps exclude the dominance, x is preferred to y on at least one 
attribute, and y is preferred to x on at least one attribute. We will consider the 
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set A p containing the pairs of alternatives ( x , y ) such that x> y is a preference 
swap of order p, for p e {2,... ,n}. Clearly, A/ - is partitioned between the sets 
2?, A 2 ,..., A„, whereas the preferential information T 5 may contain preference 
swaps on any order. We believe the order of a preference swap gives an indication 
about the cognitive complexity of a statement : when alternatives x and y differ 
on many attributes, the statement (x > y) is hard to understand. 

From now on, we define an explanation as a chain of sequences such that 
each sequence x l > x l+1 is either a preference swap of at most a given order or a 
dominance relation ( V ) between the two alternatives. More precisely, this type 
of explanation transforms one single comparison x > y that the DM needs to 
understand by a sequence of several preferences x l > x 1+1 . The idea is that the 
initial preference x > y is complex to understand as the values of x and y differ 
on most (if not all) attributes, whereas each intermediate comparison x l > x l+1 
is much easier to understand as x l and x l+1 differ only on a few attributes. In 
other terms, if x l and x l+1 have the same value on all attributes except on a few 
ones, then the preference will be apparent for the DM. We see that the order of 
the preference swap helps to better understand the explanation. Thus, we can 
define an explanation as follows: 

Definition 6 (Explanation^k) 

An explanation of length q of the necessary relation x > y is a sequence 
(a; 1 >; x 2 )... (x 9_1 > x q ) of dominance relation and/or preference swaps of order 
at most k (V u A 2 u ••• u Ak) such that x 1 - x and x q = y and x 2 ,..., x q ~ x 6 X. 

According to Definion [6] the set of statements that admit an explanation in 
V u A 2 u • • • u Afc is noted £k- 

Remark 1 We note that if the definition of explainability hy dominance rela¬ 
tions and preference swaps of order at most k allows the intermediate alterna¬ 
tives x 2 > ... > x q_1 to be taken anywhere in space X, these alternatives can be 
restricted to the augmented reference scales V without loss of generality. 

4.2 Low order preference swaps 

Even though we have defined £k for a general index fc, we will consider from now 
on only the case k = 2. The reason is that preference swaps of order 2 are much 
simpler to understand for the decision maker. (However, one may use £k, with 
k > 2, if there is no explanation in £2 of a given preference in x> y. Obviously, 
£2 £ A/ - ). 

The concept of swaps between two attributes is also known in engineering. 
For instance, the Architecture Tradeoff Analysis Method (ATAM) is used in 
order to assess software architectures according to “quality attribute goals” 
(KKCOOI . A trade-off point is an architecture parameter affecting at least two 
quality attributes in different directions. For example, increasing the speed of 
the communication channel improves throughput in the system but reduces its 
reliability. Thus the speed of that channel is a trade-off point. The concept of 
trade-off point in ATAM makes explicit the interdependencies between pairs of 
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attributes. The decision maker can express preferences among trade-offs from 
these dependencies. 

From Definition [S] (with k = 2), it is important to observe that technically an 
explanation is a path from x to y describing a decreasing sequence of preferences 
in the directed graph G := (V,T> u A 2 ), such that the vertices are in V (which 
has an exponential size with respect to the number of criteria) and the edges are 
preference swaps of order 2 or dominance relation (for more details see Section 
0 . In what follows, we shall see that we can face some technical problems in 
order to find such a path. 

4.3 Discussion on some technical challenges with expla¬ 
nation 

In this work we consider that the basic building blocks of an explanation is 
a preference swap of order 2. Before going any further on how to construct 
such an explanation, we shall discuss some points that we believe important 
for the definition of an explanation. In fact, a primary question regarding an 
explanation is to be able to say if it is satisfactory or not for the decision maker. 
To account for that, we shall discuss several points related to an explanation, 
namely: existence, length, values of the attributes involved in the sequences and 
position of the different swaps. 

These four points are important both from a theoretical point, a practical 
point of view and an implementation or operational point of view, as they may 
have an impact on the impression of the decision maker on the explanation. In 
order to construct a convincing explanation, the framework should be at the 
same time theoretically rigorous and pertinent from a practical (implementa¬ 
tion) perspective too. We draw the attention of the reader on the fact that it 
is outside the scope of the paper to resolve all the points discussed below. Our 
aim is to highlight some of the complexities around the question of constructing 
an explanation. 

• Existence of an explanation 

The first point to consider in the construction of an explanation is to make 
sure there is one to be found. Without any additional assumption, it is 
quite possible that £2 £ A/”, that there are some statements that cannot be 
explained by preference swaps of order 2 and dominance relations. 

For instance, in the context of Example[2J we have ( A, B , C, d) > (a, b , c, D) e 
M \£2 (see Example [S] in Section IOI) . 

Is it possible, by imposing conditions on the preferential infor¬ 
mation V, to ensure the existence of an explanation from every 
statement in Ml 

Technically, checking if we can explain a statement x > y in £ 2 , can be seen 
as determining if the vertices x and y are connected in the directed graph 
G := (V,T> u A 2 ). Of course, we have efficient algorithms in order to test 
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if a graph is connected or not |EvenT75] . However, it may be challenging 
to use them with regard to the size of the graph in our context. 

In Section [5j we come up with answers to these challenging questions, 
under the assumption of binary reference scale. In this specific case, the 
existence of an explanation is guaranteed when preferences from V only 
refer to swaps of order 2 , and can be efficiently checked otherwise. 

• Length of an explanation 

A second point that we address here is the length q of the sequence. 
Indeed, keeping the explanation short has a great bearing on its ability to 
convince. Even if each elementary comparison x 1 > x z+1 is trivial for the 
decision maker, the overall sequence x > x 2 > ... > x q_1 > y cannot be seen 
as a convincing explanation if its size exceeds a given value. One then 
looks for explanations with the smallest possible size. 

Finding the shortest explanation means resolving the problem of shortest 
path in the directed graph G := (V,2?u A 2 ). Thus, the length of a shortest 
explanation is bounded by the diamete of the graph G. Finding such a 
diameter is a classical problem in graph theory for which we have polyno¬ 
mial algorithm in terms of number of vertices and edges (see for instance 
IACM96I ) . However, it can be too slow to be practical in our case with 
\V\ vertices (exponential with respect to the number of criteria). 

In what follows, we come up with two results concerning the length of 
the explanations provided by our framework. First, without additional 
assumptions, the length of the explanation is unbounded, as soon as there 
are 3 criteria or more. 

Theorem 7 (Unbounded length of shortest explanations £ 2 ) 

Consider n - 3. For every peN*, if X 1 2 {0,1,2,..., 2 p}, X 2 2 {-p, -p + 
1 ,..., - 1 , 0 } and X 3 2 {-p, -p+ 1 , ...,- 1 , 0 }, then there exists some prefer¬ 
ential information V and x, y e A' such that x>y and the minimal length 
of the explanation in V u A 2 is at least 2 p. 

The proof of this result can be found in m 

For the sketch of the proof, we construct, for every p , a preference between 
x = (0,0,0) and y = (2 p,-p,-p). Starting from alternative (0,0,0), we 
begin with a preference swap between attributes 1 and 2 (adding value 
1 on the first attribute, and subtracting 1 on the second one). Then we 
perform a preference swap between attributes 1 and 3 (adding value 1 on 
the first attribute, and subtracting 1 on the third one). We proceed then 
again by a preference swap between attributes 1 and 2 , and so on (the 
sequence is depicted in Figure HJ). 

2 The diameter of a graph G is the longest distance between two vertices in graph. 
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Figure 4: Description of the sequence 


Second, under the assumption of binary reference scales , restricting the 
number of values that the preferential information can take on each at¬ 
tribute, we provide a tight upper bound on the length of explanations in 
Section [5] 

• Values of the terms in the sequence 

Another point concerns the choice of the values of the intermediate alter¬ 
natives x 2 ,... , a; 9-1 on the different attributes. In fact, if these values are 
all different and not chosen appropriately, this can induce a cognitive load 
to the DM, when analyzing the sequence x > x 2 > ... > x 9-1 > y. Many dif¬ 
ferent choices of the values of the intermediate alternatives x 2 , ..., x 9-1 on 
the different attributes can be considered. A first option is that the value 
of these alternatives on each attribute can be only the value of x or y. We 
think that this case is suitable if we need to explain only one comparison 
x > y. However, this is not always the case, and one can ask for explana¬ 
tions for several comparisons x > y, x' > y', x" > y", .... At this moment, 
the intermediate alternatives used for the different explanations will be 
quite different using the first option. In the second option, the value of 
these alternatives on each attribute can only take predefined values (where 
the predefined list of values does not depend on x and y). Hence the in¬ 
termediate alternatives contained in the explanation of x > y, x' > y', ... 
use the same values on the attributes, which reduces the workload for the 
DM. Some policy concerning the preferential use of some particular values 
in V, favoring the values V of the preferential information, or the values 
of the attributes of the alternatives being compared, can prove more or 
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less convincing, depending on the context (see Section [5]). 

• Swaps positions 

Finally, another important aspect is the position of the swaps in the se¬ 
quence. In other words, we believe that the ordering of the swaps in the 
explanation may have an impact on the persuasiveness of the explanation. 
Various policies can be considered, for instance Strongest first, where the 
strongest support (attribute) is presented first, in order to get early on 
at least a provisional agreement from the decision maker In this case, we 
should be able to determine which attribute is the most convincing for 
the DM, or Safer first presenting first the attributes that refer directly to 
the preferential information given by the DM, ... This has some connec¬ 
tion with rethoric where an important aspect is to determine the order in 
which the arguments are to be presented to the audience, depending on 
their sign and strength [MG96] . 

Technically, let us consider an explanation as a sequence of swaps, rather 
than a sequence of alternatives. As a preference swap specifies that an 
increase on a given criterion is compensated by a decrease on another 
criterion (according to the preferences of the decision maker), regardless 
of the values of the other attributes, it is quite possible that the same swap 
appears at different places in the graph in order to connect the vertices 
(a; 1 >; y 1 ), ( x 2 > y 2 ),.... A remarkable property of the sequence of swaps 
is the possibility to commute between swaps related to different criteria : 
let the explanation x > y > z with the swaps si = (x > y) and S 2 = (y £ z) 
that are not related to the same criteria, then we also have an explanation 
x > y > z involving S 2 followed by si. This ability to change the order 
of the swaps inside an explanation suggests some freedom to present the 
different arguments in an order that will help to increase the persuasiveness 
of an explanation. Furthermore, when the swaps share a criterion, which 
is increased by one and decreased by another one (for instance, the first 
swap compensates an increase on i by a decrease on j and the second swap 
compensates an increase on j by a decrease on k ), it might be possible 
to trade directly between criteria i and k , without involving j. We apply 
this principle of reduction by transitivity, under the assumption of binary 
reference scales, in Section [5] 


5 Properties of explanations when preferences 
are expressed on binary reference scales 

This section aims at deepening the understanding of the explanation engine 
introduced in section [4l by focusing on the case where preferences only refer 
to two distinct values of the attributes on each scale. We will refer to this 
assumption as binary reference scales. Section 15.11 discusses the meaning and 
relevance of this assumption. Section [521 introduces the partition of criteria into 
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arguments supportive or not of a preference query, and leverages the assumption 
of binary reference scales to simplify indexes, covectors, and rounding of queries 
introduced in section scales. Section T5.31 reveals the core term-by-term structure 
of any explanation, and resulting properties. 

5.1 Binary reference scales 

Binary reference scales are encountered when the preferences V expressed by 
the decision maker only reference two levels on each attribute : Vi € N,Vi = 
{-U < Tj}. Besides luck, such a tight reference set is the consequence of one of 
these two situations : 

• attributes are themselves binary : present or absent features, passed or 
failed checks, etc. Also, such binary attributes may result from any model 
relying on subset comparisons. While they fall outside the scope of this 
article, we believe the explanation engine discussed here can address prob¬ 
lems not necessarily resulting from an additive utility decision model (for 
instance, robust weighted majority decision models rely on subset com¬ 
parisons between coalition of criteria, as do pan-balance comparisons en¬ 
countered in extensive measurement problems). 

• when expressing preference statements, the decision maker is deliberately 
restricted to comparing between prototypical alternatives specifically chosen 
in rii(Ejv{-U, Tj}. This process is supposed to help the decision maker focus 
on the main aspects of the preference problems, by limiting the number 
of moving parts between alternatives, and by referring to carefully chosen 
reference values, serving as anchors. This technique is used in the field 
of experimental design (yielding the one-factor-at-a-time or the factorial 
experiments methods), as well as in multicriteria decision aiding. For 
instance, the MACBETH method |BV95l IBLCB08] is based on binary 
alternatives : in order to assess hidden technical parameters (the weights 
of the various criteria), the decision maker is asked to express preference 
between prototypical alternatives, traditionally referencing a neutral level 
1 i (for technological products, representing the attribute of a mid-range, 
available product), and a high level T, (representing the attribute of a 
luxury product, or a hypothetical performance demanding a technological 
breakthrough). Note that, while MACBETH assumes commensurability 
between the neutral levels ( U)i^N and between the high levels (t i)^N 
(representing the satisfying reference level, in the sense of Simon |Sim56j . 
commensurability is neither needed nor assumed in this article. Not that 
in MACBETH, the two reference are called Low and High and are not 
necessarily assumed to be commensurate. 

5.2 Positive and negative arguments of a statement 

When preferences reference exactly two values on each criterion, i.e. Vi = 
{lj, Tj}, the index set boils down to the criteria set : I = N x {1} = N. There 
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are one-to-one correspondences between the set V x V of queries q expressed 
on the reference scales, the set {- 1 , 0 , 1 }^ of covectors q*, and the partitions of 
TV between positive q + , negative q~ and neutral q° arguments (see Example O- 
Let (x >? y) e V x V : 

• positive arguments : 

(x >? y) + = {i e N,Xi = T* and y* = ij = {i e TV, (x >? y)* = +1} 

• negative arguments : 

{x >? y)~ = {* € N,Xi = 1, and y* = Tj = {i e TV, (x >? y)* = -1} 

• neutral arguments : 

(x >? y)° = {* e N,Xi = yj = {ieN, (x >? y)* = 0 } 

Example 3 (Ex [TJ ctd.) For the statement (x £ y) introduced in Example 
GJ positive arguments (x > y) + = {Size, Cost }, negative arguments (x > y)~ = 
{Commute time, Sport club}, and there is no neutral argument. 

As a shortcut, when N + and N~ are two disjoint subsets of TV, we denote 
( N + > N~ ) the query for which positive and negative arguments are respectively 
N + and N~. Its associated covector is (7V + > TV - )* = T,^n+ d* - T,un~ d* ■ 

Dominance T> is characterized by statements without negative arguments 
(TV + > 0 ). Associated covectors are in {0,1}^ and can be broken down as a 
sum of elementary dominance covectors Y,isN+ d{ ■ 



Figure 5: Binary relation between criteria 

A preference swap of order 2 has exactly one positive argument i and one 
negative argument j : ({*} £ {j}). It expresses the preference of a raise of 
attribute i from the bottom to the top of the scale V), over a corresponding 
decrease of attribute j, or the acceptance to compensate a nominal decrease of 
attribute j with a nominal increase of attribute i. The set A 2 of preference swaps 
of order 2 can be viewed as a binary relation between criteria, and represented 
by a graph with nodes in TV : (i > j). This idea is illustrated on the Figure [5] 
and Example[4] 
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Example 4 (Ex[ 7J and Ex [H cirfj The necessary preference relation deduced 
from the preferential information given in Example [H contains the following 
preference swap of order 2, represented on Figure [31 

Thus, A 2 =■{{Commute time > Sport club), (Size > Sport club), (Cost > 
Commute time) , (Cost > Sport club), (Cost > Size)}. For instance, the abbre¬ 
viated statement (Cost > Sportclub), represented by the arrow from Cost to 
Sport club, means that an alternative ranking higher than D on attribute Cost 
and low on attribute Sportclub is necessarily preferred to one ranking low on 
Cost (between d and D) and high on Sportclub, attributes Commute time and 
Size being equal : (*, b, *,D) > (*,B, *, d) 

Generally, a statement ( N + > N~) is a preference swap of order \N + \ + |1V - |. 
When considering a query ( x >? y) and a criterion i for which preferences V 
only reference two values V. = {lj < Ti}, the rounding rules presented in section 
[3] boil down to 5 mutually exclusive cases for attributes Xi,yt : 

• i is a strong argument supporting x, as Xi > Ti > _U > Vi- Then, (x >■? y)* = 
+1 ; 

• i is a weak argument supporting x, as xi > yi but [lj, T,] $ [yi,x ;]. Then, 
(x >? y)* = 0 ; 

• i is a truly neutral argument, as Xi = yi. Then, (x >? y)* = 0 ; 

• i is a weak argument supporting y, as Tj > yi > Xi > lj. Then, ( x >? y)* = 

-i; 

• i is a strong argument supporting y, as either yi is greater than both Xi 
and Ti, or Xi is smaller than both yi and Li. Then, (x >? y) £ N and the 
covector (x >? y)* is not defined. 

5.3 Term by term explanations 

The following theorem reveals the core structure every explanation is built upon. 


Theorem 8 (Term by term explanation) 

Let a statement a e (V x V) nj\f, and <t + ,<t ~ be the positive and negative argu¬ 
ments of a respectively. The following propositions are equivalent : 

[(*)] 

1 . a e E 2 

2. 3 aeN*, 7 i,-, 7 9 e A 2 ,fi,-,4eN,mi,-,m n €N: 

aa* = E 47fe + E nikd(. 

k k 

3. There is a matching of cardinality |cr - ] in the graph of A 2 n (er + x er~). 
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4- There is an injection <f>: a -> a + such that Vfc e a , (|</>(fc)} >; {fc}). 

In a nutshell, an explanation is a sequence where, at each step, a positive argu¬ 
ment is used up to cancel an inferior negative argument, and, eventually, every 
negative argument has been cancelled. We highlight three consequences of this 
theorem : 

• If preferences only refer to swaps of order 2, then every necessary prefer¬ 
ence can be explained by swaps of order 2. 

The next result shows that Af = £ 2 - This is a potent existence result 
for explanations, and it provides a complete description of the necessary 
preference relation under the assumption of the decision maker expressing 
preferences between alternatives differing along two criteria only. Com¬ 
pared to theorem [6j we do not restrict ourself to preference statements in 
VxV. 

Corollary 1 (Existence of an explanation in £2 ) In the case of bi¬ 
nary reference scales and V £ A 2 then AT = £ 2 ; i.e: for any statement 
(x > y) e Af, there exists an explanation in £2 of (x > y). 

Proof. Let s = (x > y) e A f. By theorem [6j a := [x >■? y ] meets condition 
(ii). Then condition (i) of theorem [5] holds for a. As (x,x),(y,y) 6 D, we 
also have s e £ 2 . □ 


• Explanations can be kept short. The next corollary proves that the size of 
the explanation is at most n/2, which appears manageable for the recipient 
of explanation. 

Corollary 2 (Length of the explanation) For any statement (x > y) e 
Af, there exists an explanation with a length at most [^J + 2, where [mj 
denotes the integer part of m. 

• Building an explanation, or ensuring there is none, is handled by an effi¬ 
cient algorithm (see Algorithm Qp. A quick inspection of the complexity 
reveals that in the first part of the algorithm, there are at most 0(n 2 ) 
calls to a linear program (with n the number of criteria). This is followed 
by the resolution of a matching problem, which runs in its simpler version 
in 0(n 3 ). Note that in theory, the number of constraints and variables 
of the LP may be exponential in n, because of the number of preference 
statements can be. In practice, this is of course highly unrealistic as it is 
too demanding for the DM. And for a polynomially bounded number of 
preference queries, the algorithm is efficient. 
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Algorithm 1: FindExplanation 


Data: a statement a = (x > y) to be explained, a set of preference statements V. 
Result: a matching of each negative argument by a stronger positive one. 

1 Compute c7 + ,cr - 

2 if |cr + | < |cr~| then 

3 return None 

4 if cr £ Af then 

5 return None 


6 

7 

8 
9 

10 

11 

12 

13 

14 

15 


Build the graph of A2 n (cr + x a~) : 

Initialize Q as a graph with nodes cr + u a~ and no edge, 
for i e cr + do 

for j e <r~ do 

if the LP with |AT| + \P\ inequality constraints, |iV| equality constraints and 
|IV| + \V\ variables 
VpeV,£ p >0 
V/c e N , m/e > 0 

Vk e N , YipeV Pk + = 1 i/ k = i, -1 if k = j, 0 else, 

is feasible then 

add edge (i,j) to Q 


16 Find a matching of maximum cardinality C in bipartite graph Q. 

17 if C < |cr - | then 
is return None 

19 return (j) 


positive arguments 
[Cost) = 


(Size] = 


negative arguments 
=>( C ommute time] 


Sport clubj 


Figure 6: Matching returned by Algorithm [l] with data of Example [2] 


If the input statement (a; > y ) is in £ 2 , the algorithm returns a mapping 0 
matching each negative argument by a stronger positive argument. <j) is not an 
explanation per se. In order to provide a suitable sequence, the elementary links 
- swaps (( t>(k ) > k) k€ ( x > y y, and maybe some dominance relations - need to be 
sorted. Depending on the context of the decision, some policy concerning this 
sorting could prove more efficient than others, and this area of freedom left by 
the explanation engine should be investigated, theoretically and/or empirically, 
before any actual implementation. 

There is also some leeway concerning the values of the attributes appearing 
in the explanation sequence. Consider the following policies : 

• generate the shortest possible explanation, by grouping every dominance 
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relations into one single step, and providing a sequence where every term 
is described by attributes similar to either x or y. This is depicted in the 
sequence ©; 


x 1 = x 


x 3 (2/{d(ji),ji}’ x -{d(.h) , j i}) 


(3) 


xV (y{<i>(Ji)di, — ><l>(Jp)dp}’ X -{<l>(ji)dl,--;<l>(jp)dp}') 

x p+2 = y 


• generate an explanation one step longer, by having both an initial and final 
dominance steps, and providing a sequence where every term is described 
by attributes on the reference scales derived from V , yielding the following 
sequence: 


x 1 = x 

9 

X = X 



(4) 



The first policy seems a better fit for one-shot explanations, as it provides a 
shorter explanation looking a lot like the input statement. The second policy 
would be preferred for batch explanations, as all the explanations provided 
would share a common foundation of prototypical alternatives. 

Example 5 (E$l\ and Ex H ctd.) 

Figure [5] shows the bipartite graph of the relation A 2 restricted to pairs of 
positive-negative arguments. The double arrows highlight a matching of cardi¬ 
nality 2, covering the negative arguments, as returned by Algorithm 0: {(Cost > 
Commute time), (Size> Sportclub)}. Therefore, the statement (x > y ) can be 
explained by a sequence of preference swaps of order 2 and dominance relations. 
We propose 4 explanations, with x=(-45 min, no gym, 450 m 2 , -5000 €) and 
y=(-15 min, gym, 180 m 2 , -12500 €) : 

• x A 2 (-15 min, no gym, 450 m, -12500 €) A 2 y 

• x A 2 (-45 min, gym, 180 m 2 , -5000 €) A 2 y 


• x V ( a,b,C,D ) A 2 ( A,b,C,d ) A 2 ( A,B,c,d ) T> y 

• x V ( a,b,C,D ) A2 ( a,B,c,D ) A2 ( A,B,c,d ) V y 
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Example 6 (E^J\ and Ex H ctd.) 

From Figure{7\ one can readily see that (A,B,C,d) > ( a,b,c,D ) € JV. Let 
us now try to find an explanation in £ 2 - The positive and negative arguments 
are er + - {Cost} and a~ = {Commute time, Sport club, Size}. Considering /\ 2 
as described in Example [7J we can see that condition (Hi) of theorem\B\ is not 
satisfied, and thus (A, B,C,d ) > ( a,b, c, D ) e J\f \ S 2 . 

6 Conclusion 

Generating explanations to justify recommendation is a key challenge to decision- 
aiding systems. While we witness the emergence of highly sophisticated meth¬ 
ods to elicit preferences and compute recommended alternatives, the question 
of explanation is often neglected. We believe this may hinder the development 
of such systems. As a matter of fact, real decision makers prefer the use of a 
very basic model if its outcomes are transparent, rather that elaborate models 
that look as black box for them. They need explanation to accept the decision 
support recommendation. 

In this work we address this question in what is arguably the most basic 
model used in MCDM: a simple additive model, which is given here a robust in¬ 
terpretation when preferences are incomplete (as is usually the case in practice). 
Due to the presence of such imprecise utility functions, we propose an original 
approach where explanations are conceived as sequences of simple steps (simple 
comparisons of options). To be of practical interest, such sequences must exist, 
be of reasonable length, and each step should be easily checked by the DM. We 
have thus focused on these fundamental features in this paper. What we show 
in that respect is that the size of explanation depends on the number of values in 
the attribute space that are used in the preferential information. In particular, 
when there is no restriction on this number, the size of the explanations cannot 
be bounded. On the other hand, when the preferential information relies on two 
values per attribute only, we obtained a sharp bound on the size of a provably 
existing explanation. 

One question is whether our approach (in short: explanations conceived as 
sequences of simple steps) can be exported to other settings. While some general 
notions may be reused with different preference models, we emphasize that some 
key properties of the additive model (in particular the ability to work on pairs 
of criteria, thanks to separability) are exploited in this work. 

Still, there are many possible extensions to this work. We cite some of the 
most prominent here. 

Firstly, there remain theoretical questions to be studied. We have investi¬ 
gated two extreme cases: in the first one, no assumption is made on preferential 
information (yielding a negative result in terms of the length of the explanation), 
while in the second one we assume a binary reference scale (and can guarantee 
the existence of a short explanation). A natural but challenging question is 
whether the complexity of the reference scale can be more generally linked to 
the size of the explanations. 
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Secondly, we have provided an algorithm for the binary case only. It would be 
of practical interest to design and implement an algorithm finding the simplest 
(e.g. shortest) explanation in the general case. 

Thirdly, while we discuss good theoretical properties of explanations, an 
empirical validation remains to be conducted on other aspects mentioned (the 
sequencing of swaps, the choice of values, for instance). What makes the exercise 
difficult though is that this may highly depend on the context of use: a DM 
who needs to justify an important decision before a committee may not have 
the same expectations as a DM taking a decision for herself. 

Finally, the framework may be smoothly extended to cater for more gen¬ 
eral situations. For instance, the nature of the preferential information may be 
different. The DM may use a more expressive language, and give some state¬ 
ments on the intensity of their preferences. A first step in that direction is to 
assume a quaternary relation, of the form “oi is more intensely preferred to 02 
than 03 is preferred to 04 ”. While this would constitute a first step towards 
dealing with intensities, we are confident that this may still be handled within 
the framework described here. As a final suggestion on a possible extension of 
this framework, we note that this work makes the assumption that elicitation 
and explanation are dealt with separately. A certainly promising perspective is 
to extend the framework so that explanation and elicitation are actually inter¬ 
twined. By putting forward an explanation, the system shows some evidence 
which can in turn trigger some reaction from the DM. 
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A Proofs 


Proof. [Theorem [7] 

Let n = 3, p e N*. Assume that X\ 2 {0,1,2,...,2 p}, X 2 2 {-p,-p + 
1 , ..., - 1 , 0} and X 3 2 {- p , -p + 1 ,..., - 1 , 0). Consider the following preference 
information V: 


Vj e{0.P -1} 

(2j, ~j, ■) ~ (2j + 1 ,-j - 1, •) (5) 

Vj s {0,... ,p - 1} 

(2j + l,-,-j)>(2j + 2,.,-j-l) (6) 

Hence Vi = { 0 , 1, 2 ,.. ., 2p}, V 2 = {-p,-p + 1,..., — 1 , 0 } and V 3 = {-p,-p + 

1,...,-1,0). 

We set x = (0,0,0) and y = (2p, -p, -p). With this V, we clearly obtain the 
sequence 

z =(0,0,0) > (1,-1,0) (by©) 

> (2,-l,-l) > ••• (by©) 

fc(2p-2,-(p-l),-(p-l)) 

~ (2p - 1, —Pj -(p - 1)) (by©) 

^(2 P,~P,~p) = y (by©) 


so that x > y. This sequence is of length 2 p. 

There remains to prove that this is the shortest explanation. 

To this end, we first need to determine the form of A2. According to Theorem 
|TJ we need only to consider the elements inA2 that belong to Vi x V 2 x V3 (The 
other ones can be deduced by Pareto dominance). The preference information 
© and © is very special. In particular, any value k 6 V\ appears only in two 
examples - one in which k appears in the left hand side (in ©) and the other 
one where k appears in the right hand side (in ©). Moreover, we notice that, 
in © and ©, the value on the first attribute is always increasing from the 
left hand side to the right hand side, and the value of the second and the third 
attributes is decreasing from the left hand side to the right hand side. Hence the 
elements of A2 cannot be obtained by a combination of two or more preference 
information. They are obtained only from one preference information (© and 
©) and Pareto dominance V. More precisely, A2 is composed of the following 
preferences 

(*, J, k) Z 

where either there exists l such that i = 21 , j = 21 + 1, j > -l > -l - 1 > j' and 
k = k' , or there exists l such that i = 2Z + 1, j = 21 + 2 , j - j' and k> -l > -l- 1 > k'. 
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From this, one can readily see that the explanation of the preference of x over 
y described earlier is the shortest one. 

□ 


Proof. [ Theorem [ 8 ] : term-by-term explanations] 

We prove ( i ) => ( ii ) => (in) => (iv) => ( i ) : 

• ( i ) => (ii) : Suppose there is a sequence ( x 1 > x 2 ),..., (x r ~ 1 > x r ) of 
dominance relations and/or preference swaps of order 2 such that x 1 - x 
and x r = y. Without loss of generality, the terms of this sequence can 
be taken on the reference scales : x k € V (see remark [ 1 ] in section 01 
with k = 2). Chasles relation for covectors (th [ 2 ]) applied to this sequence 
yields Y,k( xk ~ x k+1 )* = (x 1 > x r )*, which in turn yields the relation 
sought with a = 1 after sorting the (x k > x k+l ) between preference swaps 
and dominance relations. Note that this covector transformation is a 
destructive compression, as it does not preserve the sequential structure 
of an explanation : there is no hope of restoring the original order of the 
swaps 7 *;. 

• (ii) => (in) : Suppose there exists integer coefficients a,£±, ■■■,£ q , mi, ■■■, m n 
and preference swaps of order 2 : 71 , ■■■, 7 q such that 

oct* = ^]47fe+ ( 7 ) 

k k 

Multiplying both sides of covector equation (0 by the vector ( 1 , —, 1 ), we 
obtain the relation : 


M := a(\a + \- \a |) = ^mfc^O 

To homogenize the right-hand side, we introduce a dummy criterion 0 
standing for dominance : dk - (k > 0) and denote N = N u {0}. Thus, 
relation Vu A 2 is a graph with nodes in N. Re-indexing coefficients Ik by 
the positive and negative arguments of swap 7 k (summing up duplicates 
if needed), and introducing £k ,0 : = Wfc : 

a cr* = Yj ( 8 ) 

(i > j)eI>nA2 

In order to complete the flow £, we introduce : 

— a source s supplying flow t s ^ = a to the positive arguments i e a + ; 

— a sink t collecting flow £ h t = a from the negative arguments j e a~, 
and £0 ,t ~ M from node 0 . 

Covector equation (JS]) ensures £ defines a feasible flow on the graph (N u 
{s, t},2?uA 2 u{s) xcr + ucr _ x{t}u{(0,t)}), without capacity constraints, as 
projection on the i th coordinate ensures flow conservation for node i € N. 
Flow £ can be decomposed as a superposition of : 
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— cycles, involving necessary equivalence between the nodes, and not 
contributing to the value of the flow; 

— paths from the source s to the sink t passing through node 0, denoting 
a dominance relation. Their total contribution to the value of the flow 
is M; 

— paths from the source s to the sink t not passing through node 0 , with 
an overall contribution of ax|cr~| to the value of the flow. Each of these 
paths links a positive argument i\ e <j + to a negative argument i r e a~ 
through necessary preference swaps of order 2. Transitivity of the 
necessary preference relation entails that i\ is necessarily preferred 
to i r : the edge (ii,i r ) belongs to A 2 n ( a + x a~). 

We reduce flow t by ignoring the cycles and paths passing through node 0. 
Also, the flow a carried by the path from source to sink s -*■ i\ > i 2 £ ... > 
i r -► t is redirected to edge (ii,i r ). As a result, we obtain a flow of value 
a|cr"| on the graph of the relation A 2 restricted to a + x a~. This entails 
the existence of a matching of cardinality \cr~\ in this graph, obtained by 
setting an upper capacity constraint of value 1 on each edge leaving the 
source s and entering the sink t (as a cut of capacity C on the network 
with capacity constraints Cjj e {l,oo} is a cut of capacity ax C on the 
same network with capacity constraints a x Cij). 

• (Hi) => ( iv ) is simply a rewording. 

• (iv) => (i) : Let <f> : a~ -*■ cr + , injective, such that Vk e a~, ({<j>(k)} £ {k}). 
In subset form, the statement cr expressing the necessary preference of 
alternative xeF over alternative y 6 V can be written (a + > cr - ). Given 
any ordering n of the negative argument set er~, we can build a sequence 
of alternatives of decreasing preference x\'= x> ...> xi CT -| +1 e V such that 
the k th statement (xk Z Xk+i) matches the preference swap (<j>(nk) Z 7 r^). 
Thus, the sequence of sets (xk Z y)~ decreases from a~ to 0 , one element 
at a time, and the sequence of sets (xk Z y) + also decreases from cr + to 
a + \0[cr _ ], one element at a time. If the set a + \0[<r~] is empty, X\ a -j +1 = y, 
and the sequence x = x 1 > ... > x ^ l +1 = y is an explanation of (x > y) 
by preference swaps of order 2, of length |cr _ | + 1. Else, x^ a l +1 + y but 
( x l +1 >? y) is a dominance statement, as its negative argument set is 
empty. Thus, the sequence x = x 1 > ... > x^ a ' +1 > y is an explanation of 
(x £ y) by preference swaps of order 2 and a dominance relation, of length 
1(7 1 + 2 . 


□ 
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